Scale-invariant clustering with minimum volume ellipsoids

نویسندگان

  • Mahesh Kumar
  • James B. Orlin
چکیده

This paper develops theory and algorithms concerning a new metric for clustering data. The metric minimizes the total volume of clusters, where the volume of a cluster is defined as the volume of the minimum volume ellipsoid (MVE) enclosing all data points in the cluster. This metric is scale-invariant, that is, the optimal clusters are invariant under an affine transformation of the data space. We introduce the concept of outliers in the new metric and show that the proposed method of treating outliers asymptotically recovers the data distribution when the data comes from a single multivariate Gaussian distribution. Two heuristic algorithms are presented that attempt to optimize the new metric. On a series of empirical studies on real and simulated data sets, we show that volume-based clustering outperforms k-means clustering.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Scale Invariant Clustering using Minimum Volume Ellipsoids

This paper develops theory and algorithms concerning a new metric for clustering data. The metric minimizes the total volume of clusters, where volume of a cluster is defined as the volume of the minimum volume ellipsoid (MVE) enclosing all data points in the cluster. This metric has the scale-invariant property, that is, the optimal clusters are invariant under an affine transformation of the ...

متن کامل

On the Minimum Volume Covering Ellipsoid of Ellipsoids

We study the problem of computing a (1+ )-approximation to the minimum volume covering ellipsoid of a given set S of the convex hull of m full-dimensional ellipsoids in Rn. We extend the first-order algorithm of Kumar and Yıldırım that computes an approximation to the minimum volume covering ellipsoid of a finite set of points in Rn, which, in turn, is a modification of Khachiyan’s algorithm. F...

متن کامل

Minimum Volume Enclosing Ellipsoids

Two different methods for computing the covering ellipses of a set of points are presented. The first method finds the optimal ellipsoids with the minimum volume. The second method uses the first and second moments of the data points to compute the parameters of an ellipsoid that covers most of the points. A MATLAB software is written to verify the results.

متن کامل

On the Minimum Volume Covering Ellipsoid of Ellipsoids

Let S denote the convex hull of m full-dimensional ellipsoids in Rn. Given > 0 and δ > 0, we study the problems of computing a (1 + )-approximation to the minimum volume covering ellipsoid of S and a (1 + δ)n-rounding of S. We extend the first-order algorithm of Kumar and Yıldırım that computes an approximation to the minimum volume covering ellipsoid of a finite set of points in Rn, which, in ...

متن کامل

Sedimentation of Oblate Ellipsoids at low and Moderate Reynolds numbers

In many applications to biophysics and environmental engineering, sedimentation of non-spherical particles for example: ellipsoids, is an important problem. In our work, we simulate the dynamics of oblate ellipsoids under gravity. We study the settling velocity and the average orientation of the ellipsoids as a function of volume fraction. We see that the settling velocity shows a local maximum...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computers & OR

دوره 35  شماره 

صفحات  -

تاریخ انتشار 2008